Statistical Analysis of Arabic Phonemes for Continuous Arabic Speech Recognition
نویسندگان
چکیده
Although Arabic is the world’s second most spoken language in terms of the number of speakers, Arabic automatic speech recognition (AASR) did not receive the desired attention from the research community. In this paper, we introduce thorough statistical analysis of the Arabic phonemes from a widely used Arabic corpus that was developed by King Fahd University of Petroleum and Minerals (KFUPM) with support of King Abed Al-Aziz City for Science and Technology (KACST). We study various parameters, such as the number of frames a phoneme occupies, the phonemes frequency, the mean length in frames, the standard deviation, the mode, and the median of the phoneme boundary. In addition, other language-model related information such as the bigram information is also studied. The results showed that phonemes can be clustered into groups. Based on statistical information, one can design the most suitable HMM for each phoneme in terms of the number of states and other model parameters. Keywords—Phoneme; Arabic Speech Recognition; MFCC, Mode; Median; KACST Arabic speech corpus; HMM; Acoustic Model.
منابع مشابه
Helpful Statistics in Recognizing Basic Arabic Phonemes
The recognition of continuous speech is one of the main challenges in the building of automatic speech recognition (ASR) systems, especially when it comes to phonetically complex languages such as Arabic. An ASR system seems to be actually in a blocked alley. Nearly all solutions follow the same general model. The previous research focused on enhancing its performance by incorporating supplemen...
متن کاملArabic phonemes transcription using data driven approach
The efficiency and correctness of continuous Arabic Speech Recognition Systems (ARS) hinge on the accuracy of the language phoneme set. The main goal of this research is to recognize and transcribe Arabic phonemes using a data-driven approach. We used the Hidden Markov Toolkit (HTK) to develop a phoneme recognizer, carrying out several experiments with different parameters, such as varying numb...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملClassification of the Arabic Emphatic Consonants using Time Delay Neural Network
This study concerns the use of Artificial Neural Networks (ANNs) in automatic classification of the emphatic consonants of the Standard Arabic Language (SAL). It reinforces the few works directed towards the speech recognition in Standard Arabic. We have applied the Time Delay Neural Network (TDNN) approach which permits a classification of the phonemes by taking into account the dynamic aspect...
متن کاملVocal Pathologies Detection and Mispronounced Phonemes Identification: Case of Arabic Continuous Speech
We propose in this work a novel acoustic phonetic study for Arabic people suffering from language disabilities and non-native learners of Arabic language to classify Arabic continuous speech to pathological or healthy and to identify phonemes that pose pronunciation problems (case of pathological speeches). The main idea can be summarized in comparing between the phonetic model reference to Ara...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012